Integrating Pattern Growth Mining on SQL-Server RDBMS

نویسنده

  • Ronnie Alves
چکیده

Most of the FIMI implementations are memory-like. However when we deal with data warehouse the size of datasets are extremely huge for memory copy. However, using a pure SQL-like approach is inefficient, even almost implementations rarely take advantages of SQL-Extensions such as SQL-Cursors, Store Procedures and UDF functions. Furthermore, RDBMS vendors offer a lot of SQL-Extensions for taking control and management of the data. We propose a pattern growth mining approach tightly coupled on RDBMS for finding all frequent itemsets. The main idea is to avoid one-at-atime record retrieval from the database, saving both the coping and process context switching, the candidate set generation test (expensive joins), and table reconstruction. The experimental results obtained show that our approach is competitive with the most known SQL-like approaches. Our performance evaluation was made with SQL Server 2000 (v.8) and T-SQL.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SQL Based Association Rule Mining using Commercial RDBMS (IBM DB2 UDB EEE)

Data mining is becoming increasingly important since the size of databases grows even larger and the need to explore hidden rules from the databases becomes widely recognized. Currently database systems are dominated by relational database and the ability to perform data mining using standard SQL queries will definitely ease implementation of data mining. However the performance of SQL based da...

متن کامل

SQL Based Association Rule Mining Using Commercial RDBMS (IBM DB2 UBD EEE)

Data mining is becoming increasingly important since the size of databases grows even larger and the need to explore hidden rules from the databases becomes widely recognized. Currently database systems are dominated by relational database and the ability to perform data mining using standard SQL queries will definitely ease implementation of data mining. However the performance of SQL based da...

متن کامل

Improving Analysis Of Data Mining By Creating Dataset Using Sql Aggregations

In Data mining, an important goal is to generate efficient data. Efficiency and scalability have always been important con-cerns in the field of data mining. The increased complexity of the task calls for algorithms that are inherently more expensive. To analyze data efficiently, Data mining systems are widely using datasets with columns in horizontal tabular layout. Preparing a data set is mor...

متن کامل

A Better Approach for Horizontal Aggregations in SQL Using Data Sets for Data Mining Analysis

To analyzing the data efficiently in Data mining systems are widely using datasets with columns in horizontal tabular layout. Generally preparing a data set is the more complex task in a data mining project, require many complex SQL queries, aggregating columns and joining tables. Conventional RDBMS usually manage tables with vertical form. Aggregated columns in a horizontal tabular layout retu...

متن کامل

Optimization of Horizontal Aggregation in SQL by Using K-Means Clustering

To analyze data efficiently, Data mining systems are widely using datasets with columns in horizontal tabular layout. Preparing a data set is more complex task in a data mining project, requires many SQL queries, joining tables and aggregating columns. Conventional RDBMS usually manage tables with vertical form. Aggregated columns in a horizontal tabular layout returns set of numbers, instead o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005